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1 Linearity testing and Arrow's theorem 

Monday, 27th February 2012 

• Open Problem [Guy86, HK92]: Let a G M n with ||a|| 2 = 1. Prove Pr xe{ _! 1} „ [| (a, x)\ < 
1]>|- 

• Open Problem (S. Srinivasan): Suppose g : {— 1, l} n — > l] where g(x) G [|, l] if 
£?=i*i>f and 5 (x)G [-1,-|] ifE? = iXi<-f. Prove deg(/) = fi(n). 

In this workshop we will study the analysis of boolean functions and its applications to 
topics such as property testing, voting, pseudorandomness, Gaussian geometry and the 
hardness of approximation. Two recurring themes that we will see throughout the week 
are: 

• The noisy hypercube graph is a small set expander. 

• Every boolean function has a "junta part" and a "Gaussian part". 

1.1 The Fourier expansion 

Broadly speaking, the analysis of boolean functions is concerned with properties of boolean 
functions / : { — 1,1}™ —> {—1,1} viewed as multilinear polynomials over R. Consider the 
majority function over 3 variables MAJ3(x) = sgn(xi + X2 + £3). It is easy to check that 
MAJ3(x) = 2 x l + \ x i + ^ x 3 ~ ^ x i x 2 x 3, and this can be derived by summing 2 3 = 8 
polynomials p y : { — 1, l} 3 — > {—1,0, 1}, one for each y G {—1, l} 3 , where p y (x) takes value 
MAJ3(x) when y = x and otherwise. For example, 

*><-,,!,-!, (x) = (i^i) (±±2) (1^2) ■ MAJ 3 (-i, 1, -1). 

Note that the final polynomial that results from expanding and simplifying the sum of 
PyS is indeed always multilinear (i.e. no variable Xi occurs squared, or cubed, etc.) since 
xf = 1 for bits Xi G { — 1, 1}. The same interpolation procedure can be carried out for any 
/:{-l,l}»->R: 

Theorem 1 (Fourier expansion) Every f : { — 1, l} n — > R can be uniquely expressed as 
a multilinear polynomial R, 

f( x ) = c S±\ x i> where each cs G R. 
SC[n] ieS 

We will write f(S) to denote the coefficient cs and xs( x ) f or the function n«e5 Xi ' an( ^ ca ^ 
f( x ) = J2sc[n] f(S)xs( x ) the Fourier expansion of f. We adopt the convention that X0 = 1; 
the identically 1 function. We will write deg(/) to denote max5 C [ ra ]{|5| : f(S) ^ 0}, and 
call this quantity the Fourier degree of f. 
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We will sometimes refer to xs( x ) '■ {~ 1> 1}™ ~~ * { — 1> 1} as the "parity-on-5" function, since 
it takes value 1 if there are an even number of —1 coordinates in x and —1 otherwise. 
Using the notation of Theorem 1, we have that MAJ^({1}) = ±, MAJ^({1, 2, 3}) = 

MAJ^({1,2}) = 0, and deg(MAJ 3 ) = 3. 

We have already seen that every function / : {—1, 1}™ — > M. can be expressed as a multilinear 
polynomial over IR (via the interpolation procedure described for MAJ3); to complete the 
proof of Theorem 1 it remains to show uniqueness. Let V be the vector space of all functions 
/ : {— 1, l} n — > K. Here we are viewing / as a 2 n -dimensional vector in IR 2 ™, with each 
coordinate being the value of / on some input x G {—1, l} n ; if / is a boolean function this 
is simply the truth table of /. Note that the parity functions xs( x ) are ah elements of V, 
and furthermore every / G V can be expressed as a linear combination of them (i.e. the 
Xs( x )' s are a spanning set for V). Since there are 2 n parity functions and dim(l/) = 2 n , 
it follows that {xs ■ S C [n]} is a basis for V, and this establishes the uniqueness of the 
Fourier expansion. 

Definition 2 (inner product) Let f,g : {—1, l} n — > {—1, 1}- We define the inner prod- 
uct between f and g as 

Note that this is simply the dot product between / and g viewed as vectors in M. 2 " , normal- 
ized by a factor of 2~ n . Given this definition, every boolean function / : { — 1, l} n — > {—1, 1} 
is a unit vector in M 2 ™ since (/, /) = 1. We will also write ||/|| 2 to denote (/, /), and more 
generally, ||/|| p := E[\f(x)\P]VP. ' 

Theorem 3 (orthonormality) The set of parity functions {xs( x ) '■ S C [n]} is an or- 
thonormal basis for R 2 " . That is, for every S,T C [n] , 



(xs,Xt) 



1 if S = T 
otherwise. 



■ a; 2 



Proof. First note that xs ■ XT = Xsat since ]J ieS Xi]J jeT xj = Hies AT x * UjeSnT '-3 
YlieSAT x i' wn ere the final equality uses the fact that xf = 1 for X{ G {—1, 1}- Next, we 
claim that 

'1 if [7 = 
otherwise. 



u\ 



noting that this implies the theorem since SAT = iff S = T. Recall that we have defined 
X% to be the identically 1 function, and if U ^ then exactly half the inputs x G { — 1, l} n 
have xu( x ) = 1 an d the other half xu( x ) = — 1- B 



Proposition 4 (Fourier coefficient) Let f : {-l,l} n -»■ R. Then f(S) = (f,xs) = 
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Proof. To see that this holds, we check that 

(f,Xs) = ( E f(T) XT ,Xs)= £ f(T)-( X T,Xs) = f(S). 



T -N TC[n] 

Here we have used the Fourier expansion of / for the first equality, linearity of the inner 
product for the second, and orthonormality of parity functions (Theorem 3) for the last. ■ 

Next we have PlanchereFs theorem, which states that the inner product of / and g is 
precisely the dot product of their vectors of Fourier coefficients. 

Theorem 5 (Plancherel) Let f,g: {-1, 1}" -> R. Then, (f,g) = Esc W f(S)g(S). 
Proof. Again we use the Fourier expansions of / and g to check that 

(f,g) = ( E f(S)xs, E g(T)xT) = ^Y,f^9(T)-(xs,XT)= £ Hs)g(S). 

X SC[n] TC[n] 1 g T SQn] 

The second equality holds by linearity of inner product, and the last by orthonormality. ■ 

An important corollary of Plancherel's theorem is Parseval's identity: if / : { — 1, 1}" — > R, 
then H/H2 = (/, /) = Esc[n] f(S) 2 (*- e - the Fourier transform preserves L2-norm). In 
particular, if / is a boolean function then Esc[n] f(^) 2 = E[/(x) 2 ] = 1, which we may view 
as a probability distribution over the 2 n possible subsets S of [n]. Note that if / and g are 
boolean functions then f(x)g{x) = 1 iff f(x) = g(x), and so E[f(x)g(x)] = 1 — 2- dist(/, g), 
where dist(/, g) = Pr[/(x) 7^ g(x)] is the normalized Hamming distance between / and g. 

One of the advantages of analyzing / via its Fourier expansion is that this polynomial 
encodes a lot of combinatorial information about /, and these combinatorial quantities can 
be "read off" its Fourier coefficients easily. We give two basic examples now. Recall that 
for functions / : {—1, l} n —> R, the mean of / is E[/(x)] and its variance is Var(/) := 
E[/(x) 2 ] — E[/(x)] 2 . Note that if / is a boolean function then E[/(x)] measures the bias of 
/ towards 1 or —1, and Var(/) = 4 • Pr[/(x) = 1] • Pr[/(i) = —1]. If / has mean and 
variance 1 we say that / is balanced, or unbiased. 

Proposition 6 (expectation and variance) Let f : { — l,l} n — > R. Then E[/(x)] = 
/(0), andV aL r(f) = Z s ^f(S) 2 . 

Proof. For the first equality, we check that /(0) = E[f(x)x$(x)] = E[/(x)]. The second 
equality holds because 



Var(/) = E[f(xY) - E[/(^ = ( E W ) - WY = £ f(S) J . 



SQn] 



Here the second equality uses an application of Parseval's identity. ■ 

It is nice to think of f(S) 2 as the "weight" of / on S, with the sum of weights of / on all 
2 n subsets S of [n] being 1 by Parseval's. Often it will also be convenient to stratify these 
weights according to the cardinality of the set S. 
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Definition 7 (level weights) Let f : { — 1, 1}™ — > R and k E {0, . . . , n}. TTie weight of f 
at level k, or the degree-k weight of f , is defined to be W fe (/) := J2\s\=k f($) 2 ■ 

For example, in this notation we have W°(MAJ 3 ) = W 2 (MAJ 3 ) = and W 1 (MAJ 3 ) = 
W 3 (MAJ 3 ) = 1/2. 

1.1.1 Density functions and convolutions 

So far we have been viewing domain { — l,l} n of our functions simply as strings of bits 
represented by real numbers ±1. Often we would like to be able to "add" two inputs, 
in which case we will view our functions as / : FJ? — > R instead. The mapping from F2 
to {-1,1} is given by (-l) b , sending E F 2 to 1 E R and 1 E F 2 to -1 e R. We 
will sometimes also associate a boolean function Fg — > {—1, 1} with its corresponding F 2 
polynomial F'J? —> F 2 . For example, the different representations of the parity function xs 
are given in Table 1. 



Xs (x) : ^ {-1,1} 


X H-> YiieS X i 


XS (x) :F2->{-l,l} 




X s(x) :F 2 l ^F 2 





Table 1: Different representations of xs(x) 

The ¥2 degree of a boolean function / : FJ? — > {—1, 1}, denoted deg Fa (/), is its degree as 
an ¥2 polynomial Fg — > F 2 . For example, the parity functions are degree-1 F 2 polynomials; 
in contrast, recall that deg(xs'), the Fourier degree of xs> 1S In general we have the 
inequality deg Fa (/) < deg(/) for all boolean functions / : Fj — > {—1, 1}- We remark that 
unlike Fourier degree, it is not known how to infer the F 2 degree of a boolean function from 
its Fourier expansion. The following useful fact can be easily verified: 

Fact 8 Let X s ■ F£ -> {-1, 1}. Then X s{x + y) = Xs{x) ■ Xs(y)- 

Definition 9 (probability density function) tp : ¥% — > R- is a probability density 
function of B xeF n [<p(x)] = 1. 

Note that a probability density function 93 : F2 — > R— corresponds to the probability 
distribution over Fg where Pr[x] = (p(x) ■ 2 n . For example, the constant function (p = 1 
corresponds to the uniform distribution over F?;. For any a £ Fj, the density function (p a (x) 
that takes value 2 n if x = a and otherwise corresponds to the distribution that puts all 
its weight on a single point a E F^. 

Definition 10 (convolution) Let f, g : ¥% — > R. The convolution of f and g is the 

function f*g:¥' 2 l ->R defined by (f * g)(x) := ~E y& n[f(y)g(x + y)]. 



G 



Note that (/ * g)(x) = (g * f)(x), since (y, y + x) is just a uniformly random pair of inputs 
with distance x and therefore has the same distribution as (y + x,y). Similarly it can be 
checked that the convolution operator is commutative: (f*g)*h = f*(g*h). The following 
facts also follow easily from definitions: 



Fact 11 Let f: R 



and(p2,tfi2 be density functions. Then 



1. (cp,f)=E y ^[f(y)}. 

2. (<p*f)(x)=E y ^[f(x + y)}. 

3. The density for z = y\ + y2, where y\ ~ tpi and y2 ~ (f2, is (fi * y>2- 

Theorem 12 (Fourier coefficients of convolutions) Let f^g : F?> — > 1 

f(S)-g(S). 



Then f * g(S) 



Proof. We check that 

f7g(S) = E[(f*g)(x)xs(x)] 



E 



E 

y 

E 

y 

E 

y 



W(y)9(x + y)} -xs{x) 

y 

f(y)-E[g(x + y)xs(x)} 

X 

f(y)-E[g(z) X s(z + y) 

z 

f(y)-E[g(z)xs(z)xs(y)} 



(1) 
(2) 



= E[f(y) X s(y)]-ng(z)xs(z)] = f(S)-g(S). 

y 

Here (1) uses the fact that z — y = z + y for y,z 6 Fg, and (2) is an application of Fact 8. ■ 
Theorem 13 Let f,g,h:W%-> R. Then (f * g,h) = (f, g*h). 

Proof. By Theorem 12 and Plancherel, both sides of the identity equal J2sc\n] f(S)g(S)h(S). 



1.2 Blum-Luby-Rubinfeld 

We begin by considering two notions of what it means for a function / : — > F2 to be 
linear. 

Definition 14 (linear #1) A boolean function f : ¥ T 2 l — > ¥2 is linear if f(x + y) = f(x) + 
f(y) for all x,y G Fg. 

Definition 15 (linear #2) A boolean function f : F% — > F2 is linear if there exists a%, . . . , a n G 
F2 such that f{x) = a\X\ + . . . + a n x n . Equivalently, there exists some S C [n] such that 

f( x ) = SieS Xi - 
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Proposition 16 (#1 



#2) These two definitions are equivalent. 



Proof. Suppose / satisfies f(x + y) = fix) + f(y) for all x,y G F™. Let a, = /(ej) G F2, 
where e« is the i-th canonical basis vector for It follows that f{x) = f '(Y17=i x i e i) = 
Sr=i x */( e *) = SILi *^*' wn ere the second equality uses Definition 14 repeatedly, along 
with the fact that f(xiei) = Xif(ei). For the reverse implication, note that f{x + y) = 

Sigs( x + y)i = J2ies x i + YliesUi = f ( x ) + /(^)> w h ere the first and final equalities uses 
Definition 15. ■ 

It is natural to consider analogous notions for approximate linearity. 

Definition 17 (approximately linear #1) A boolean function f : F?J — > F2 is approxi- 
mately linear if f(x + y) = f{x) + f(y) for most pairs x, y G Fg . 

Definition 18 (approximately linear #2) ^4 boolean function f : — > ¥2 is approxi- 
mately linear if there exists some S C [n] such that f(x) = X^eS Xi f or m °st x £¥2- Equiv- 
alently, there exists an S C [n] such that f is close in Hamming distance to g{x) = Y^ies Xi - 

A straightforward generalization of argument given in the proof of Proposition 16 shows that 
Definition 18 (approximately linear #2) implies Definition 17 (approximately linear #1). 
However, the argument for the reverse implication no longer holds. We will adopt Definition 
18 as our notion of approximate linearity for now, and we will see that the linearity test 
of Blum, Luby, and Rubinfeld [BLR93] implies that both definitions are in fact equivalent. 
The Fourier-analytic proof we present here is due to Bellare et. al [BCH + 96]. 

Definition 19 (BLR linearity test) Given blackbox access to a function f : F?? — > ¥2, 

1. Pick x,y G FJ? independently and uniformly. 

2. Query f on x,y and x + y. 



Theorem 20 (soundness of BLR) //Pr[BLR accepts /] > 1 — e then f is e-close to being 
linear (in the sense of Definition 18). 

Proof. It will be convenient to think of / as F?j — > {—1, 1}, and so the acceptance criterion 
(i.e. step 3) becomes f{x)f(y) = f(x + y). Viewing / this way, now note that 



3. Accept iff f(x) + f(y) = f(x + y). 



Pr[BLR accepts /] 



E [l(f(x)-f(y) = f(x + y)] 

x,y L J 
E [l 2 + l 2 .f( X )f(y)f( X + y)] 

i + |E[/(x)-(/*/)(x)] 




(3) 



SC[n] 




(4) 



SC[n] 
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Here (3) uses Parseval's identity and (4) uses Theorem 12. Therefore, if Pr[BLR accepts /] > 
1_ E then l-2e < £ s <- [n] f(S) 3 < max{/(S)}.£ sg[n] />) 2 = max{/(5)}, or equivalently, 
there exists an S* C [n] such that /(5*) > 1 - 2e. Since /(5*) = E[/(ar)x s .(x)] = 
1 — 2 • dist(/, xs*)j we have shown that dist(/, xs*) < e and the proof is complete. ■ 

Theorem 20 says that if Pr[BLR accepts /] > 1 — e then / is e-close to some linear function 
Xs*'i however, we do not know which of the 2 n possible linear functions xs* this is. The 
following theorem tells us that we can nevertheless obtain the correct value of xs* ( x ) with 
high probability for all xSFJ. 

Theorem 21 (local decodability of linear functions) Let f : — > F 2 be e-close to 
some linear function xs*, & n d let x 6 The following algorithm outputs XS*( X ) with 
probability at least 1 — 2e: 

1. Pick y 6 F 2 uniformly. 

2. Output f(y) + f(x + y). 

Proof. Since x and x + y are both uniform (though not independent), with probability at 
least 1 — 2e we have f(y) = xs* (y) and f{x + y) = xs* i x + y)- The claim follows by noting 
that xs* (y) + Xs* ( x + y) = Xs* (y + ( x + ?/)) = X5* where we have used the linearity of 
Xs* along with the fact that x + y = x — y for x,y ■ 

1.3 Voting and influence 

• Puzzle: Is it possible for / : { — 1, l} n —> {—1,1} to have exactly k non-zero Fourier 
coefficients, for k = 0,1,2,3,4,5,6,7? Classify all functions with 2 non-zero Fourier 
coefficients. 

• Puzzle: Find all / : {-1, l} n -> {-1, 1} with W x (/) = 1. 

We may think of a boolean function / : {—1,1}™ — > { — 1,1} as a voting scheme for an 
election with 2 candidates (±1) and n voters (x±, . . . , x n ). Many boolean functions are 
named after the voting schemes they correspond to: the i-th dictator DICTj(a;) = X{ (i.e. 
DICTj = Xi)i A: -juntas (functions that depend only on k of its n variables, where we think 
of k as <C n, or even a constant); the majority function MAJ(x) = sgn(xi + . . . + x n ). The 
majority function is special instance of linear threshold functions f(x) = sgn(ao + a\X\ + 
. . . + a n x n ), a,i £ R, also known as weighted-majority functions, or halfspaces. Another 
important voting scheme in boolean function analysis is TRIBES WjS : { — 1, 1} WS — >■ { — 1, 1}, 
the s-way OR of w-w&y AND's of disjoint sets of variables (where we think of —1 as true 
and 1 as false). In TRIBES„, jS , the candidate —1 is elected iff at least one member of each 
of the s disjoint tribes of w members votes for —1. 

The following are a few reasonable properties one may expect of a voting scheme: 

• Monotone: if x« < yi for all i £ [n] then f(x) < f(y). 

• Symmetric: f(i:(x)) = f(x) for all permutations n G S n and x S {—1, l} ra . 
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• Transitive-symmetric (weaker than symmetric): for all i,j G [n] there exists a permu- 
tation 7r G S n such that ir(i) = j and /(x) = f(ir(x)) for all x G { — 1, l} n - 

Later in this section (for the proof of Arrow's theorem) we will also assume that voters vote 
independently and uniformly; this is known as the impartial culture assumption in social 
choice theory. 

Definition 22 (influence) Let f : { — l,l} n — > { — 1,1}. We say that variable i G [n] is 
pivotal for x G {—1, l} n if fix) ^ f(x® 1 ), where x® 1 is the string x with its i-th bit flipped. 
The influence of variable i on f , denoted Infj(/), is the fraction of inputs for which i is 
pivotal. That is, := Pr[/(x) + f(x®% 

For example, Infj(DICTj) is 1 if i = j and otherwise. For the majority function over 
an odd number n of variables, Infj(MAJ) = Q n "7)/2) ' 2~( n_1 ) since voter i is pivotal iff 
the votes are split evenly among the other n — 1 voters. By Stirling's approximation, this 
quantity is ~ %j2/irn = @(l/y/n). 

Definition 23 (derivative) Let f : {—1, 1}™ — > K. The i-th derivative of f is the function 
(Dif)(x) := ^(/(x*^ 1 ) — f(x li 1 )), where x l ^ b is the string x with its i-th bit set to b. 

Note that if / is a boolean function then (D{f)(x) = ±1 if i is pivotal for / at x, and oth- 
erwise, and therefore E[(Z?j/)(x) 2 ] = Infj(/). We will adopt E[(Z)j/)(x) 2 ] as the generalized 
definition of the influence of variable i on / for real-valued functions / : {—1, l} n — > R. 

Theorem 24 (Fourier expressions for derivatives and influence) Let f : {—1, l} n — > 
R. Then 

1. (D i f)(x) = J2 S3i f(S)xs\i(x). 

2. Inf i (/) = E59i/>) 2 - 

Proof. The first identity holds by noting that (DiXs)(x) = Xs\i( x ) if « G 5 and otherwise, 
along with the fact that Di is a linear operator, i.e. Di(af + g) = a(Dif) + (Dig). The 
second identity then follows by applying Parseval to Infj(/) = E[(Dj/)(x) 2 ]. ■ 



Proposition 25 (influence of monotone functions) Let f : { — 1,1}" — > {—1,1} be a 
monotone function. Then Infj(/) = f(i). 

Proof. If /is monotone then (A/)(x) G {0,1} andsoInf,(/) = E[(A/)(x) 2 ] = E[(Dif)(x)] = 
Dififb) = f(i)- Here the final equality uses the Fourier expansion of D^f given by Theorem 
24. ■ 

An immediate corollary of Proposition 25 is that monotone, transitive-symmetric func- 
tions / have Infj(/) < 1/y/n for all i £ [n]. This follows from the fact that transitive- 
symmetric functions satisfy f(i) = f(j) for all i,j G [n], along with the bound Y17=l /(*) 2 — 

Effcw /(S) 2 = 1- 
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Definition 26 (total influence) Let f : {— l,l} n — > E. The total influence of f is 

>:r ,inr,(/;. 

If / is a boolean function, then 

J\ J\ r « 

inf (/) = y, pr[/(*) / /(^ ffil )] = £ vmm + n^))} = e e ^ /(^)) 



1=1 i=l 



i=l 



The quantity Ei=i ^-(f( x ) 7^ f( x ® 1 )) is known as the sensitivity of / at x, and so the total 
influence of a boolean function is also known as its average sensitivity. If / is viewed as a 
2-coloring of the boolean hypercube, the total influence can also be seen to be equal to n 
times the fraction of bichromatic edges. 

If / is a monotone boolean function, we see that Inf(/) = Y27=i /(*) = Y17=i ^[f{ x ) x i] = 
Ei[f(x)(xi + . . . + x n )]. Recall that if / is boolean then f(x)(x\ + . . . + x n ) = 1 if f(x) = 
sgn(xi + . . . + x n ) and —1 otherwise. Therefore the total influence a monotone boolean 
function, when viewed as a voting scheme, measures the expected difference between the 
number of voters whose vote agrees with the outcome of the election and the number whose 
vote disagrees. It is reasonable to expect this quantity to large, and the next proposition 
states that (if n is odd) it is maximized by the majority function. 

Proposition 27 (MAJ maximizes sum of linear coefficients) Let n be odd. Among 
all boolean functions f : { — 1,1}" — > {—1,1}, the quantity EILi/W ^ s maximized by 
MAJ(x) = sgn(xi + . . . + x n ). Consequently, if f is monotone then Inf(/) < Inf(MAJ) ~ 

Proof. Note that Ya=i /(») = E [f( x )( x i + ■■■ + x n )] < E[\ Xl + ... + x n \] since / is (±1)- 
valued, where the inequality is tight iff f{x) = sgn(xi + . . . +x n ) = MAJ(x). For the second 
claim recall that Infj(/) = f(i) if / is monotone (Proposition 25), and Infj(MAJ) ~ 
for all i 6 In]. ■ 



Proposition 28 (Fourier expression for total influence) Let f : {—1, 1}" — > K. Then 

Mi(/) = E S c N \s\ ■ f{s? = ELi k • wfc (/)- 

The proof of this proposition follows immediately from the Fourier expression for variable 
influence given by Theorem 24. Notice that each Fourier coefficient is weighted by its 
cardinality in the sum, and so total influence may also be viewed as a measure of the 
"average degree" of /'s Fourier expansion. 

Recall that for functions / : {—1,1}™ — > R we have Var(/) = Es^0 f(S) 2 (Proposi- 
tion 6), and comparing this quantity with the Fourier expression for total influence yields 
Var(/) < Inf(/), the Poincare inequality for the boolean hypercube. The inequality 
is tight iff W 1 (/) = 1, which for boolean functions implies that / = ±DICTj(/) for 
some i E [n]. If / is a boolean function and p := Pr[/(x) = 1], it is easy to check 
that Var(p) = 4p(l — p), and so the Poincare inequality can be equivalently stated as 
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4p(l — p) < Inf (/) = n ■ (fraction of trichromatic edges). The Poincare inequality is there- 
fore an edge-isoperimetric inequality for the boolean hypercube (where we view boolean 
functions as indicators of subsets of {—1, l} ra ): for any p, it gives a lower bound on the 
number of boundary edges between A and A where A C {— 1, l} n has density p. This is 
a sharp bound when p = 1/2, but not when p is small. For smaller densities we have the 
bound 2alog 2 (l/a) < Inf(/), where a := min{Pr[/(x) = l],Pr[/(x) = —1]}. This in turn 
is sharp whenever a = 2 k , achieved by the AND of k coordinates. 



1.4 Noise stability and Arrow's theorem 

Let p G [0,1] and fix x G {—1,1}". Let N p (x) be the distribution on { — 1, l} n where 
y ~ N p (x) if for all i G [n], yi = Xi with probability p, and yi is uniformly random ±1 with 
probability 1 — p. More generally, for p G [—1,1], we have that N p (x) is the distribution on 
strings y where 

J Xi with probability \ + \p 
^ \ —Xi with probability ^ — -^P- 

If x ~ { — 1, 1}™ is uniformly random and y ~ N p (x), we say that x and y are p-correlated 
strings; equivalently, x and y are /9-correlated if they are both uniformly random and 
E[xjyj] = p for all i G [n]. 

Definition 29 (noise stability) Zei / : {—1, l} n — >■ R and/3 G [—1, 1]. The noise stability 
of f at noise rate p is 

Stabp(/) := E[/(x)/(y)], where x,y are p-correlated strings. 

For example Stab p (±l) = 1, Stab p (DICTj) = p, and Stab p (xs) = p' 5 '. Tomorrow we will 
prove Sheppard's formula: lim^oo Stab p (MAJ) = 1 — ^ arccos(/j). In particular, if p = 1 — 5 
we have Stab p (MAJ) = Q(y/S). 

Definition 30 (noise operator) Let p G [—1,1]. The noise operator T p on functions 
f : {-1, 1}™ -> K acte as follows: (T p f){x) := E y ^ Np{x) [f(y)}. 

Proposition 31 (Fourier expressions for noise operator and stability) Let p G [—1,1] 
and f : {-l,l} n -> R. T/ien 

L (T p f)(x) = j:sc [ n]P lSl f(S)xs(x). 
2. Stab p (/) = E5c W P |S| /(^) 2 - 

Proof. The first identity follows from the linearity of the noise operator, along with the 
observation that (T p xs)(x) = p^Xsix)- The second holds by noting that 

Stab p (/) = E [f(x)f(y)) = B[f(x)(T p f)(x)] = £ /(5)3>?(S) = £ P |5| /(S) 2 - 

(*^ corr * 5 C[n] SC[„] 
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Suppose there is an election with n voters and three candidates: A, B and C. Each voter 
ranks the candidates by submitting three bits indicating her preferences: whether the prefers 
A to B (say, —1 if so and 1 otherwise), and similarly for B versus C and C versus A. Clearly 
a rational voter cannot simultaneously prefer A to B, B to C and C to A] her ordering of 
the candidates must be non-cyclic. 

Definition 32 (rational) A triple (a,b,c) G { — 1,1} 3 is rational if not all three bits are 
equal (i.e., (a, b, c) defines a total ordering, and is a valid preference profile). We define the 
function NAE : { — 1, l} 3 — > {1, 0} to be 1 iff not all three bits are equal. 

Now suppose the preferences of the n voters are aggregated into three n-bit strings x, y and z, 
and the aggregate preference of the electorate is represented by the triple (f(x), f(y), f{z)) 
for some boolean function / : {—1, l} n — > { — 1, 1}- Clearly we would like for the the outcome 
of the election to be rational; that is, N AE( f (x), f(y), f(z)) = 1. 

Fact 33 (Condorcet's paradox [Con85]) With MAJ as the aggregating function it is 
possible that all voters submit rational preferences and yet the aggregated preference string 
is irrational. 





Vl 


V2 


^3 


MAJ 


A> B? 


+1 


+ 1 


-1 


+1 


B>C1 


+1 


-1 


+ 1 


+1 


C > A? 


-1 


+ 1 


+ 1 


+1 



Figure 1: An instance of Condorcet's paradox 

Theorem 34 (Arrow's impossibility theorem [Arr50]) Suppose f is an aggregating 
function that always produces a rational outcome if all voters vote rationally. Then f = 
±DICTj for some i £ [n]. If f is further restricted to be unanimous (i.e. /(I, . . . , 1) = 1 and 
/(— 1, . . . , —1) = —1; certainly a very reasonable assumption) then f must be a dictator. 

The main result of this section is a robust version of Arrow's impossibility theory due 
Gil Kalai [Kal02]. It expresses the probability that an aggregating function / produces a 
rational outcome in terms of the noise stability of /, under the impartial culture assumption 
(each voter selects an NAE-triple (xi,yi,Z{) uniformly and independently). 

Theorem 35 (Kalai) E[NAE(/(s), f(y), f(z))] = § - § ■ Stab_ 1/3 (/), where the expecta- 
tion is taken with respect to the impartial culture assumption. 

Proof. Using the arithmetization NAE(a, b, c) = | — \ (ab + be + ac), we first note that 

E[NAE(/(x), f(y), f(z))] = | - \ ( B[f(x)f(y)] + B[f(y)f(z)] + B[f(x)f(z)]) 

= l-lE[f(x)f(y)], 
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where again, all expectations are taken with respect to the impartial culture assumption. 
Since E[rcji/j] = —1/3 if (xi,yi,Zi) is an NAE-triple, the quantity E[/(x)/(y)] is exactly 
Stab_!/ 3 (/) and the proof is complete. ■ 

Theorem 35 does indeed imply Arrow's impossibility theorem since 

3 _ 3 . Stab_ 1/3 (/) = |-| f>±) fc • W "(/) < I + I • wl (/)> 

k=0 

and so if E[NAE(/(x), f(y), f(z))] = 1 then W 1 (/) > 1. Furthermore note that the proba- 
bility of an irrational outcome is at least 1 — e then W 1 > 1 — 0(e). By a theorem of E. 
Friedgut, G. Kalai and A. Naor [FKN02], if W 1 (f)>l-e then / is 0(e)-close to iDICTj 
for some i £ [n]. Therefore Kalai's theorem is in fact a robust version of Arrow's impossi- 
bility theorem: if most rational voter preference profiles aggregate to a rational outcome, 
then the aggregating function must be close to a dictator or anti-dictator. 

We conclude by giving an upper bound on level- 1 Fourier weight of transitive-symmetric 
functions. By Theorem 35, this gives an upper bound on the probability that such functions 
aggregate rational voter preference profiles to a rational outcome. We will also prove a 
generalization of this fact (Proposition 42) using the Berry-Esseen theorem tomorrow. 

Proposition 36 (W 1 (/) of transitive-symmetric functions) Suppose f(i) = f(j) for 
alli,je[n}. ThenVf l (f)<l+o n (l). 

Proof. First note that £?=i f(if = n ■ /(l) 2 = n ■ (± £™ =lj /00) 2 = £(££=i f^f- The 

claim then follows since we have seen that Y£=l /(*) ^ EiU MA J(^) ~ v^Wtt (Proposition 
27). ■ 
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2 Noise stability and small set expansion 

Tuesday, 28th February 2012 



• Puzzle: Compute the Fourier expansion of MAJ„. Hint: consider T p DjMAJ(l, . . . , 1). 

Roughly speaking, the central limit theorem states that if X\ , . . . , X n are independent ran- 
dom variables where none of the Xj's are "too dominant", then S = Yl7=i is distributed 
like a Gaussian as n — > oo. As a warm-up, we begin by giving another proof of the fact 
that Inf(MAJ) ~ y / 2n/n, this time using the central limit theorem. First note that 



Inf(MAJ) = E MAJ(x) £ x, 



i=l 



E 



i=l 



V^-E 



i=i Vn 



Now by the central limit theorem we know that E*Li x i ~ * Q ~ -^(0, 1) as n — > oo, and 
since E[\Q\] = \/2/tt, we conclude that Inf(MAJ) ~ y/2/ir ■ 



Definition 37 (reasonable r.v.) Let B > 1. TFe say i/iai a random variable X is B- 
reasonable ifB[X 4 } < B ■ B[X 2 } 2 . Equivalently, \\X\\ 4 < B l ' A ■ \\X\\ 2 . 

For example, a uniformly random ±1 bit (i.e. a Rademacher random variable) is 1- 
reasonable, and a standard Gaussian is 3-reasonable. The Berry-Esseen theorem [Ber41, 
Ess42] is a Unitary version of the central limit theorem, giving explicit bounds on the rate 
at which reasonable random variables converge towards the Gaussian distribution. 

Theorem 38 (Berry-Esseen) Let X\, . . . ,X n be independent, B-reasonable random vari- 
ables satisfying E[Xj] = 0. Let af := E[X 2 ] and suppose Ya=i °f = Let S = X\ + . . .+X n 
and G ~ iV(0, 1). For all t G R, 



Pr[S < t] -Pr[G <t]\< 0(e) 



where e= (B • af) 1/2 < VB • max{|ai|}. 



We prove the Berry-Esseen theorem with a weaker bound of e = (B ■ Y^=i a f) X ^ xu Section 
4.2. 



2.1 Sheppard's formula and Stab p (MAJ) 

Definition 39 (p-correlated Gaussians) Let Q,Q' ~ iV(0, 1) be independent standard 

Gaussians. Set % = (Q,Q') ■ (p, \A ~~ P 2 ) := P ' G + a/1 — • C?'. T/ien and % are 

p-correlated Gaussians. Note that if Q and T~L are p-correlated Gaussians then ~E[QT~l] = 

p ■ B[g 2 } + yT7 • E[g\ E[g'\ = P . 

Theorem 40 ([She99]) LetQ andH. be p-correlated Gaussians. Then Pr[sgn(C?) ^ sgn(%)] = 
arccos(p)/7r. 
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Proof. First recall that sgn(-u • v) is determined by which side of the halfspace normal to u 
the vector v falls on. Since % = (p, \Jl — p 2 ) • (Q,Q'), and Q = (1, 0) ■ (Q, Q 1 ), it follows that 
Pr[sgn(£?) / sgn(%)] is precisely the probability that the halfspace normal to (G,G') splits 
the vectors (p,\Jl — p 2 ) and (1,0). Therefore, 

Pr[sgn(C/) / sgn(%)] = ^ • (angle between (p, \/l — p 2 ) and (1, 0)) = ^ arccos(p). 



Next, we use Sheppard's formula to prove that Stab p (MAJ) — > 1 — | arccos(p) as n -> oo. 
First recall that 



Stab p (MAJ) = E [MAJ(x)MAJ(y)] = 1 - 2 Pr[MAJ(x) ^ MAJ(y)], 

(x,y) p-corr 



and so it suffices to argue that Pr[MAJ(x) / MAJ(y)] — > ^ arccos(p). Next, we view 
MAJ(x) = sgn( Xl + '-_ + Xn ) , MAJ(y) = sgn(^ + ' ' ' + ^ 



n 



and note that 
E 



i=i v n/ m=i v n/ J L i=i i=i J ~{ 



P- 



While the standard central limit theorem tells us that X = (x\ + . . . ±x n ) / \fn and Y = (y\ + 
• • • + Vnj/y/n each individually converges towards the standard Gaussian Q ~ N(0, 1), the 
two-dimensional central limit theorem states that (X, Y) actually converge to p-correlated 
Gaussians (Q, %) as n — > oo. In fact, the two-dimensional Berry-Esseen theorem quantifies 
this rate of convergence, bounding the error by ±0(1/ \fn) as long as p is bounded away 
from ±1. Combining this with Sheppard's formula, we conclude that 



Pr[MAJ(x) / MAJ(y)] -> Pr[sgn(^) / sgn(H)] = iarccos(p) as n — > oo. 



Since 



Stab p (MAJ) -> 1 - I arccos(p) = f + f p 3 + . . . + ( (fe * ^ ^ • p fc + . . . 
and Stab p (MAJ) = ££=o p fc - W fc (MAJ), we have that W fc (MAJ) -> ( (fe ^} /2 ) ^ as n ^ oo. 



2.2 The noisy hypercube graph 

Let p £ [—1,1]. The p-noisy hypercube graph is a complete weighted graph on the vertex 
set {—1, l} ra , where the weight on an edge (x, y) is the probability of getting x and y when 
drawing p-correlated strings from { — 1, l} n . Equivalently, wt(x,y) = Pr[x «— U]Pr[y <h- 
N p (x)] = 2~ n ■ (\ - \p) A{x ' v) (\ + \p) n - Ai - x ' y \ and the sum of weights of all edges incident 
to any x G { — 1, l} n is exactly 2~ n . 
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Recall that if / is a boolean function, then Stab p (/) = E( a . y ) p _ corr [/(x)/(y)] = 1— 2Pr[/(x) / 
/(?/)], or equivalents, Pr (:Eiy ) p _ corr [/(x) ^ f(y)} = \- \- Stab p (/). Viewing / as the in- 
dicator of a subset Af C {—1, l} n , the quantity Pr[/(x) 7^ /(y)] is the sum of weights of 
edges going from Af to its complement Af. Therefore, roughly speaking, a function / is 
noise stable iff the sum of weights of edges contained within Af and Af is large. On Friday 
we will prove the Majority Is Stablest theorem due to E. Mossel, R. O'Donnell, and K. 
Olezkiewicz [MOO 10]: 



Majority Is Stablest. Fix a constant < p < 1, and let / : {— 1, l} n — > 
{— 1, 1} be a balanced function. It is easy to see that Stab p (/) is maximized 
when W 1 (/) = 1, in which case / = ±DICTj and Stab p (DICTj) = p. However, 
if we additionally require that Infj(/) < r for all i G [n], then the theorem 
states that Stab p (/) < Stab p (MAJ) + o T (l); i.e. the set A of density 1/2 with 
all influences small that maximizes sum of weights of edges within A and A is 
^4maj- 



Let A C {—1,1}™ and 1 A (x) : {— l,l} n —> {0,1} be its indicator function, and let a := 
Efl^O)] = \A\ ■ 2~ n denote its density. Recall that Stab p (l^) = E[l A (x)l A (y)] = Pr[x G 
A & y G A], or equivalently, Pr[y G A \ x G A] = - ■ Stabp(l^), and so Stab p (l J 4) is 
the probability that a random walk (where each coordinate is independently flipped with 
probability (| — \p)) starting at a point x G A remains in A, normalized by the density of 
A. Tomorrow we will prove (a specific instance of) the small set expansion theorem: 



Small Set Expansion. Stab p (l^) < a 2 ^ 1+p \ or equivalently, Pr[y G A \ x G 
A] < a^-py^+d. In particular, if a is small, the probability that a random 
walk starting in A landing outside A is very high. 



Recall that Stab p ( I4 ) = W° ( I4 ) + p ■ W 1 ( 1 A ) + p 2 ■ W 2 ( I4 ) + ..., and so j- p Stab p ( \ A ) | p=0 = 
W 1 (1 J 4). As a direct corollary of the small set expansion theorem we have the following 
bound on W 1 (l^): 

W^Ia) = ^Stab p (l A ) < ^« 2/(1+p) = 2a 2 ln(l/a). 



p=0 



p=0 



We now give a self-contained proof this fact, due to Talagrand [Tal96]: 

Theorem 41 (level-1 inequality) Let f : {-1, l} n {0,1} and a = E[/]. Then 
Wi(/) = 0(a 2 ln(l/a)). 

Proof. First consider an arbitrary linear form t(x) = Yli=i a i x i normalized to satisfy a f = 
1. For any t$ > 1, we partition x G { — 1, 1}™ according to whether \£(x)\ < to o r 1^(^)1 — to, 
and note that 

B[f(x)£(x)] = E [l(\£(x)\ < t ) ■ f(x)£(x)] + E [l(|^(x)| > t ) ■ f(x)£(x)] . 
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The first summand is at most a ■ to, and the second is at most 

2e-* 2 /2 dt < 2 I" te~ t2 ' 2 dt = \-2e~ t2 / 2 } °° = 2 • e"^/ 2 

to 



to J to 



by Hoeffding, where the inequality holds since to > 1. Choosing to = (2 ln(l/a)) 1//2 > 1, we 
get 

V[f(x)e(x)} < 0(a^HlM). (5) 



Now let £{x) = - Y^=i f(i) x i where a = yWi(7) (if <r = we are done). Note that 

n n 

E[f(x)£(x)} = £/(i)/(0 = J £/(<) a = 
The claimed inequality then follows by applying (5). ■ 

Proposition 42 (| theorem) Let / : { — 1,1}™ — > { — 1,1} with \f(i)\ < e for all i G [nj. 
Then W x (/) < l + 0{e). 

Proof. Let a = y/W 1 (f) and assume without loss of generality that a > 1/2 (otherwise 

W 1 (f) < 1/4 < | and we are done). Let £(x) = ± Yh=i /C*) 3 * where 1^(01 ^ 2e for a11 
i G [n]. Note that E[/(x)£(x)] = cr and E[£(x)f(x)] < E[\£(x)\]. Applying Berry-Esseen 
gives us E[|^(x)|] ~o(e) = a/2/tt and this completes the proof (technically Berry- 

Esseen only yields a bound on closeness in cdf-distance, but this can be translated into a 
bound on closeness in first moments). ■ 

A dual to Proposition 42 holds as well: if W 1 (/) > — — e, then / is 0(y / e)-close to a 
linear threshold function (in fact the LTF is simply sgn( Yl7=i f{^) x i))- This is the crux 
of the result that the class of linear threshold functions is testable with poly(l/e)-queries 
[MORS 10]. 

2.3 Bonami's lemma 

The next theorem, due to Bonami [Bon70], states that low degree multilinear polynomi- 
als of Rademachers are reasonable random variables (this is sometimes known as (4, 2)- 
hypercontractivity) . 



Theorem 43 (Bonami) Let f : { — 1,1}™ — > R be a multilinear polynomial of degree at 
most d. Then ||/|| 4 < V^Wfh- Equivalent^, E[f(x) 4 ] < 9 d • E[/(x) 2 ] 2 



Proof. We proceed by induction on n. If n = then / is the constant and the inequality 
holds trivially for all d. For the inductive step, let 

f(xi, ...,x n )= g(x x , x n -i) + x n h(x 1 , x n -i) 
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Notice that g has degree at most d, h has degree at most d — 1, and both are polynomials 
in n — 1 variables. Notice also that the random variable x n is independent of both g and h. 
Therefore, we have: 

E[/ 4 ] = E[(g + x n h) 4 } 

= E[g 4 } + 3E[x n ] E[g 3 h] + 6E[a£] E[g 2 h 2 } + 3E[a£] E[gh 3 } + E[x 4 ] E[h A ] 

where we used independence for the second equality. Now note that E[x n ] = E[x^] = 0, 
E[xl] = E[x 4 ] = 1, and E[g 2 h 2 } < ^/E^l^/E^] by Cauchy-Schwarz. Therefore, 

E[/ 4 ] < Et/l+ev ^Iy/EM + Et/i 4 ] 

< 9 d E[g 2 } 2 + 6^9 d E[g 2 ] 2 ^ d - 1 E[/i 2 ] 2 + 9^ E[h 2 } 2 
= 9 d -{E[g 2 ] 2 + 2E[g 2 ]E[h 2 ] + lE[h 2 ] 2 ) 

< 9 d -(E[<? 2 ]+E[/i 2 ]) 2 

To complete the proof, notice that: 

E[/ 2 ] = E[(g + x n h) 2 } 

= E[g 2 } + 2E[x n ]E[gh} + E[x 2 n ]E[h 2 } 
= E[g 2 ]+E[h 2 } 

and so we have shown that E[/ 4 ] < 9 d ■ E[f 2 } 2 . U 
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3 KKL and quasirandomness 

Wednesday, 29th February 2012 

• Open Problem: Prove that among all functions / : {—1, l} n —> { — 1, 1} with deg(/) < 
d, the quantity Y2?=i /(*) ^ s maximized by MAJ^. Less ambitiously, show Y17=l /00 = 
O(Vdfef)). 

3.1 Small set expansion 

We begin by proving the p = 1/3 case of the small set expansion theorem: let ^4 C {—1, l} n 
be a set of density a = \A\ ■ 2~ n , and 1a '■ {—1, l} ra — ► {0, 1} be its indicator function. We 
will need the following variant of Bonami's lemma; its proof is identical to that of Theorem 
43. 

Theorem 44 (Bonami') Let f : {-l,l} n -> R. Then ||T 1/v ^/|| 4 < ||/|| 2 - 

Theorem 44 is a special case of the hypercontractivity inequality [Bon70, Gro75, Bec75]: if 
l<p<q<oo and p< ^/fif, then \\T p f\\ q < \\f\\ p . 

Theorem 45 (SSE for p = 1/3) Let A C {-l,l} n . T/ien Stab 1/3 (l j4 ) < a 3 / 2 , w/iere a 
is i/ie density of A. 

Proof. We will need a corollary of Theorem 44 that will also be useful for us when proving 
the KKL theorem in the next section: HTi^/lH — Wflll/s- To see that this holds, we check 
that 

IIT^/Ill = E[/(x)(T 1/3 /)(x)] 

< ll/IU/s-ri/s/IU (6) 

= ll/IU/s-ll^^/^iu 

< ll/IU/3 ■ l|r 1/Vg /||2. (7) 



Here (6) is by Holder's inequality and (7) by applying Theorem 44 to Ty^f; dividing both 
sides by H^i/^/lh yields the claim. Applying this corollary to / = 1a completes the proof: 

Stab 1/3 (l4) = £ (lf l f(Sf = \\T 1/VS l A \\l < E [l A (x) i/3 ] V2 = « 3/2 - 
SC[n] 

Here the second equality is an application of Parseval's, and the final uses the fact that 1a 
is {0, l}-valued. ■ 
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3.2 Kahn-Kalai-Linial 

Theorem 46 ([KKL98]) Let f : {-1,1}" -»■ {-1,1} with E[/] = 0, and set a := 
max ie[n] {lnf i(f)}. Then Inf(/) = n(log(l/a)). 

Proof. Recall that variable i is pivotal for a; in / iff (Dif)(x) = dbl, and so E[|A/|] = 
Infj(/) < a; the plan is to apply the small set expansion theorem to A/ and sum over all 
i 6 [n]. We have shown in the proof of Theorem 45 that ll^i/^^fflli — 1 1 ^ 1 1 2/3 ' anc ^ a PPly m g 
it to g = Dif gives us 

Stab 1/3 (A/) < HA/lll/3 = E [|A/| 4/3 ] 3/2 = Inf 4 (/) 3/2 . (8) 
We sum both sides of the inequality over i £ [n], starting with the left hand side. Recall 
that Stab 1/3 (A/) = Esc[n) ilf l DTf(S) 2 = (|) |5hl /(S) 2 , and so: 

n n 

£stab 1/3 (A/) = EEQ) |Sh W 

i=l i=l Ssi 

= E i^KI) |shl /» 2 

|S|>1 

> E |5i(i) |shl /W 

l<|5|<2Inf(/) 

> £ 2Inf(/).(I) 2Inf(/) - 1 /(S) 2 (9) 

l<|5|<2Inf(/) 

> 3Inf(/)-(i) Inf(/) . (10) 

Here (9) uses the fact that decreasing function when x > 1, and (10) uses 

Markov's inequality X]|s|>2inf(/) f(S) 2 — | along with the assumption that / is balanced. 
Now summing the right hand side of (8) gives us 

E"=i Inf i(/) 3/2 < « • Inf (/)- Combining 
both inequalities yields 3 • (g) Inf ^ < a 1 / 2 , and therefore Inf(/) = Sl(log(l/a)) as claimed. 



Corollary 47 Let f : {-l,l} n -> {-1,1} tfliifc E[/] = 0. T/ien max ie[n] {Infi(/)} = 



This bound on the maximum influence is tight for the Ben-Or Linial TRIBES function 
[BL89]: the 2 fc -way OR of k-w&y AND's of disjoint sets of variables (so n = k ■ 2 k ). We 
remark that while Corollary 47 only gives a log(ra) improvement over the 1/n bound that 
follows directly from the Poincare inequality, this factor makes a crucial difference in many 
applications (e.g. it is the crux of Khot and Vishnoi's [KV05] counter-example to the 
Goemans-Linial conjecture [Goe97, Lin02]). 

Corollary 48 Let f : { — 1,1}™ — > { — 1,1} be a balanced, monotone function viewed as a 
voting scheme. Both candidates can bias the outcome of the election in their favor to 99% 
probability by bribing a O ( lo \ \ ) fraction of voters. 
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3.3 Dictator versus Quasirandom tests 

Definition 49 (noisy influence) Let f : { — 1, l} n — > R and p G [0,1]. T/ie i-£/i p-noisy 
influence off is Inf^(/) := Stab p (A/) = £ S3i (S) 2 . 

Note that Inf^ 1 ^/) = Infj(/) and Inf-°^(/) = f(i) 2 , and intermediate values of p G (0,1) 
interpolates between these two extremes - the larger the value of p is, the more the weight 
f(S) 2 on larger sets S is dampened by the attenuating factor p^^ 1 . 

Definition 50 (quasirandom) Let f : { — 1,1}" — >■ M and e,5 £ [0,1]. M^e say i/iai / is 
(e, 5) -quasirandom, or f has (e, 5) -small noisy influences, i/Inf. 1 (/) < e for all i € [n]. 

A few prototypical quasirandom functions are the constants ±1 (these are (0, 0)-quasirandom), 
the majority function ((0(-^=), 0)-quasirandom) , and large parities xs ((1 — 5)' 5 ''~ 1 ,0)- 
quasirandom). Unbiased juntas, and dictators in particular, are prototypical examples of 
functions far from quasirandom. The next proposition states that even functions far from 
being quasirandom can only have a small number of variables with large noisy influence: 

Proposition 51 Let f : {-l,l} n -> R with Var(/) < 1, and let J = {i G [n] : Inff _<5) (/) > 
e} be the set of coordinates with large noisy influences. Then \ J\ < \jeb. 

Proof. We first note that 

n 

e-|J|<E Inf l M (/)<E Inf ^ ) (/)= E 1^1 • C 1 - *) |Shl /(S) 2 - 

iSJ i=l |5|>1 

It remains to check that 15*1 • (1 — <5)' 5 ' -1 < 1/5 for any S" C [n]: to see this holds, note 
that (1 — (J)! 5 ^ 1 < (1 — (5)* _1 for any i < \S\, and so summing over i from 1 to [SI gives 

us \S\ ■ (1 - S)^- 1 < EHi(1 - <^ _1 < E 4 =i(l " = We have shown that 

£ • \ J\ < (1/5) • Var(/) < 1/5, and the proof is complete. ■ 

Consider the problem of testing dictators: given blackbox access to a boolean function /, if / 
is a dictator the test accepts with probability 1, and if / is e-far from any of the n dictators it 
accepts with probability 1 — O(e). Implicit in Kalai's proof of Arrow's impossibility theorem 
(Theorem 35) is a 3-query test that comes close to achieving this: 



3-query NAE test 

1. For each i G [n], pick (xi,yi,Zi) uniformly from the 6 possible NAE 
triples. 

2. Query / on x, y, and z. 

3. Accept iff NAE(f(x),f(y),f(z)) = 1. 
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Recall that the NAE test accepts / with probability | — | ■ Stab_!/ 3 (/), and so if / = DICTj 
for some i G [n] (in particular, Stab_ 1/ / 3 (/) = —1/3) the test accepts with probability 1 (i.e. 
we have perfect completeness). However, the NAE test does not quite satisfy the soundness 
criterion. We saw that if the test accepts / with probability 1 — e then W 1 (/) > 1 — 0(s), 
and by a theorem of Friedgut, Kalai and Naor, / has to 0(e)-close to a dictator or an 
anti-dictator; the soundness criterion requires / to be 0(e)-close to a dictator. It therefore 
remains to rule out functions that are close to anti-dictators, and we will do this using the 
Blum-Luby-Rubinfeld linearity test. Recall that the BLR test is a 3-query test that accepts 
/ with probability 1 if / = xs for some SC [n], and with probability 1 — if / is e-far 
from all the parity functions. Combining the BLR and NAE tests, we have a 6-query test 
for dictatorship with perfect completeness and soundness 1 — Cl(e). In fact it is easy to 
show that we can perform just one of the two tests, each with probability ^, and reduce the 
query complexity to 3 while only incurring a constant factor in the rejection probability. 

As we will see on Saturday, for applications to hardness of approximation (UGC-hardness in 
particular) it suffices to design a test that distinguishes dictators from (e, e)-quasirandom 
functions, instead of one that distinguishes dictators from functions e-far from dictators. 

Definition 52 (DICT vs QRAND) Let < s < c < 1. A (c,s) dictator versus quasiran- 
dom test is defined as follows. Given blackbox access to a function f : { — 1, l} n — > {—1, 1}, 

1. The test makes 0(1) non-adaptive queries to f . 

2. If f is a dictator, it accepts with probability at least c. 

3. If f is (e,e)-quasirandom, it accepts with probability at most s + o £ (l). 

As we will see, often we will need to assume that / is odd (i.e. f(—x) = —f(x) for all 
x £ { — 1, l} n , or equivalently, f(S) = for all even \S\). Let us consider the NAE test as a 
dictator versus quasirandom test, under the promise that / is odd. We have seen that the 
test has perfect completeness (i.e. c = 1), and now we determine the value of s. First note 
that since / is odd, 

Pr[NAE accepts /] = § - § ■ (W°(/) - I W 1 ^) + | W 2 (/) - ^W 3 (/) + . ..) 

= !"!•(- 3WV) - ^W 3 (/) - ^W 5 (/) - . . . ) 

= ! + !-Qw 1 (/) + ^w 3 (/) + ^w 5 (/) + ...) 

= f + f • Stab 1/3 (/). 

Now let / be a (e, e)-quasirandom function. Applying the Majority Is Stablest theorem (we 
will need a statement of it for functions with e-small e-noisy influences instead of e-small 
regular influences), we have 

Pr[NAE accepts /] = | + | ■ Stab 1/3 (/) 

< | + |-Stab 1/3 (MAJ) + o e (l) 

-> | + |-(l-faiccos(i))+o e (l) (11) 

= 0.91226... + o e (l), 
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where (11) uses the estimate we proved for Stab^MAJ) using Sheppard's formula and the 
central limit theorem in Section 2.1. Therefore we have shown that the NAE test is a 
(1,0.91226. . .) dictator versus quasirandom test. 

We consider two more examples of dictator versus quasirandom tests and compute their c 
and s values: the p-noise test of S. Khot, G. Kindler, E. Mossel and R. O'Donnell [KKMO07], 
and J. Hastad's 3XOR.5 test [HasOl]. 



KKMO 2-query p-noise test 

1. Let p S [0, 1]. Pick x £ {—1, l} n uniformly, and y ~ N p (x). 

2. Query / on x and y. 

3. Accept iff f(x) = f(y). 



First note the p- noise test accepts / = DICTj with probability | + ^p, the probability that 
X{ is not flipped in y. For soundness, let / be an odd (e, e)-quasirandom function and note 
that 

Pr[KKMO accepts /] = E [| + \f{x)f{y)} 

= i + i-Stab p (/) 
< \ + | • Stab p (MAJ) + o e (l) 
-> \ + \ ■ (1 - f arccos(p)) +o e (l), 

where once again we have used the Majority Is Stablest theorem along with Sheppard's 
formula (the assumption that / is odd is used in the application of the Majority Is Stablest 
theorem, which requires need E[/] = 0). Different values of p result in different c versus s 
ratios; for example, if p = l/v2 then c = 0.85 and s = 0.75. 

For Hastad's test it will be convenient for us to view our functions as / : F2 — > {—1,1}. 
Like in the analyses of the NAE and p-noise tests, we will have to assume that / is odd. 



Hastad's 3-query 3X0 R,5 test 

1. Let 5 £ [0,1]. Pick x,y £ uniformly and independently, and set 
z = x + y. 

2. Pick x' ~ Ntsix). 

3. Query / on x', y, and z. 

4. Accept iff f(x')f(y)f(z) = 1. 



Note that this is identical to the BLR linearity test, except with the noisy x 1 instead of x. 
Once again it is easy to see that dictators pass with probability | + i(l — <5) = 1 — |, and 
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it remains to analyze soundness: 

Pr[3XOR 5 accepts /] = E [\ + \f(x')f(y)f{z)] 

x,y,x' 

= ± + ±E \E[f(x')]f(y)f(z) 

x,y t' 



= \ + \ E [(Tis f)(x) f (y) f (< 



h + \ E 



(T^ s f)(x)nf(y)f(x + y)} 



= i + iE[(T w /)(x)/T/(x)] 

= 3 + 3 E = a + 3 E (! - *) |5| /(s) 3 - 

SC[n] SC[n] 

Note that E5c[n]( 1 ~ 5) |S| /(S) 3 < max 5 c [n] {(l - <5)I 5 I \f(S)\} by Parseval's. Next we claim 
that for all e < S, if / is (e, e)-quasirandom then (1 — 5)\ s \ \f(S)\ < s/e for all S with odd 
cardinality (in particular, S / 0). To see this, assume for the sake of contradiction that 
there exist an S of odd cardinality for which this inequality does not hold. Then y/e < 
(1 - <5)I 5 I|/(5)| < (1 - e)l s l|/(S)|, and squaring both sides gives us e < (1 - e) 2 \ s \f(S) 2 < 

(1 - e) |s| - 1 /(5') 2 < Inf| 1_e) (/) for a11 i G S. Since 5^0, this contradicts the assumption 
that / is (e, e)-quasirandom. 

Since the 3XOR5 test accepts (e, e)-quasirandom functions with probability at most ^ + \\fi 
(i.e. s = we have shown that it is a (1 — |, 5) dictator versus quasirandom test. 
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4 CSPs and hardness of approximation 



Thursday, 1st March 2012 
4.1 Constraint satisfaction problems 

We begin by noting that function testers can be viewed more generally as string testers: 
the tester is given blackbox access to a string w 6 {—1,1}^ (i.e. the truth-table of /, 
so N = 2 n ), and if w satisfies some property P\ C {—1, 1}^ (e.g. dictatorship) the tester 
accepts with probability say at least |, and if w satisfies some other property P2 Q {—1, 1}^ 
(e.g. quasirandomness, far from dictatorship, etc.) it rejects with probability at least |. 

We may view (non-adaptive) string testers simply as a list of instructions. For example, 

with probability p\ query w\ , W5 , wio and accept iff ^(wi, W5, w 10) 
with probability P2 query 1017,104,1x13 and accept iff <fi$(wn, 104, W3) 
with probability p% query W2, w±2, W7 and accept iff 04(^2,^12,^7) 

Here (pi , 4>2 , • • • are predicates {— 1,1}* —> {T, F}. From this point-of-view, we see that 
a string tester naturally defines a weighted constraint satisfaction problem (CSP) over a 
domain of N boolean variables, with the predicates fa's as constraints and the associated 
Pi's as weights. The question of determining which string w £ {—1, 1}^ passes the test with 
highest probability is then equivalent to the question of finding an optimal assignment that 
satisfies the largest weighted fraction of predicates. 

Under this correspondence, an explicit (c, s) dictator versus quasirandom test for functions 
/ : {—1, 1} — > { — 1, 1} defines an explicit instance of a weighted CSP over L = 2 e boolean 
variables. Since all t dictators pass with probability at least c, there are I special assignments 
each of which satisfy at least a c weighted fraction of constraints. Furthermore, since all 
quasirandom functions pass the test with probability at most s + o(l), any CSP assignment 
which is, roughly speaking, "very unlike" the I special assignments will satisfy at most a 
s + o(l) fraction of constraints. In other words, any CSP assignment that satisfies at least 
an s+Q(l) fraction of constraints must be at least "slightly suggestive" of at least one of the 
£ special assignments (we say that / is suggestive of the i-th coordinate if Inf^ 1 (/) > e, 
and in particular, an (e, e)-quasirandom function suggests none of its coordinates). 

On Saturday Per will prove the following theorem establishing a formal connection between 
dictator versus quasirandom tests, the Unique-Label-Cover problem, and the hardness 
of approximating certain CSPs [Kho02, KR03, KKMO07, Aus08]: 

Theorem 53 Suppose there is an explicit (c, s) dictator versus quasirandom test that uses 
predicates (pi, . . . , 4> r . For every e > there exists a polynomial-time reduction where: 
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Unique-Label- Cover — > CSP with constraints <j>\, . . , , <j) r 

YES instance — > there exists an assignment that satisfies 

a (c — e) fraction of constraints 

NO instance — > every assignment satisfies at most 

an (s + e) fraction of constraints 

The Unique Games Conjecture [Kho02] asserts that approximating the Unique-Label- 
Cover problem is NP-hard. Theorem 53 therefore says that assuming the UGC, for any 
constant e > an explicit (c,s) dictator versus quasirandom test implies the NP-hardness 
of ((s/c) + e)-factor approximating CSPs with constraints corresponding to the predicates 
used by the test. 

Recall that our analyses of all three dictator versus quasirandom tests we have seen so 
far depend on the promise that / is odd (in particular, note that the (0, 0)-quasirandom 
constant function / = 1 pass both the KKMO p- noise test and Hastad's 3X0 R$ test with 
probability 1). One way to elide this assumption is to view them as testers for general 
functions g : {— 1, l}^ 1 — > {—1,1} (corresponding to half of the truth table of an odd 
function / : {—1, 1} -4 {—1, 1}, say for the inputs with xi = 1) where the respective 
predicates allow literals instead of just variables. Although the string w = 1 N (i.e. f = 1) 
trivially satisfies any CSP with constraints of the form u>, = Wj (i.e. the KKMO p-noise 
test) or WiWjWk = 1 (i.e. the Hastad 3XOR,5 test), the same is no longer true if literals are 
allowed in the constraints. Given this, we may apply Theorem 53 to the NAE test, KKMO 
p-noise test, and Hastad's 3XOR<5 test to conclude that under the UGC, 

• Approximating MAX-3NAE-SAT to a factor of 0.91226 . . . + e is NP-hard. 

• Approximating MAX-2LIN to a factor of (1 — - arccos(/o))/(^ + \p) + e is NP-hard. 

• Approximating MAX-3LIN to a factor of (^ + e) is NP-hard. 

4.2 Berry-Esseen 

In this section we prove the Berry-Esseen theorem [Bcr41, Ess42], a Unitary version of the 
central limit theorem with explicit error bounds. Actually we will give a proof that only 
yields a polynomially weaker error bound, the upshot being that the proof is relatively 
simple and can be easily generalized to other settings (as we will see tomorrow, the Mossel- 
O'Donnell-Olezkiewicz proof of the invariance principle, an extension of the Berry-Esseen 
theorem to low-degree polynomials, is very similar in spirit). We will need Taylor's theorem: 

Lemma 54 (Taylor) Let ip be a smooth function and r £ N. For all x G R and e > 

there exists ay 6 [x, x + e] such that 

j,(x + £ ) = ^(x) + e^ l \x) + • e 2 ^ 2 \x) + . . . + ^ • e^^^x) + J • e r ^ r \y). 

In particular, ip(x + e) = ip(x) + e ■ V'*' 1 '' ( x ) + \ • e 2 ^ 2 ^ (x) + | • e 3 ip^ (x) + error, where the 
error term has magnitude at most HV^Hoo • e 4 /24. 
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Proposition 55 (hybrid argument) Let X\, . . . ,X n be independent random variables 
satisfying E[JQ] = 0. Let of := E[X 2 ] and suppose Ei=i °f = ^- Let X = Yl7=l^> 
Q ~ N(0, 1) and ip : R -> R. T/ien 



E[V»(X)]-E[^)]|=0 ||^|UEE[X 



rt 



1=1 



Afofe i/iai «/ eac/i X{ is B-reasonable then Y^i=i E[^ 4 ] < -B • Ei=i a f<B- max {of}. 

Proof. We will view as the sum of independent Gaussians Gi + ■ ■ ■ + Gni where each 
Gi ~ N(0,af). The proof proceeds by a hybrid argument, showing that only a small error 
is introduced whenever each in X is replaced by the corresponding Gaussian. More 
precisely, for each i = 0, . . . , n, we define the random variable Zj := Gi + • • • + Gi + -Xj+i + 
. . . + X n ; these n + 1 random variables interpolate between Zo = X and Z n = G- We will 
prove the inequality 

| EMZi-l)} - EhKZ0]| = 0(||V (4) ||oo • ELY 4 ]). 

for all z € [n], noting that this implies the theorem by the triangle inequality. Fix i £ [n] 
and define the random variable R := Gi + • • • + Gi-i + + • • • + X n , so Zj_i = R + 
and Zj = R + ft. Our goal is therefore to bound | E[^(R + ai ■ Xi)] — E[^(R + Oi ■ ft)]|. 
Applying Taylor's theorem twice, we get 

| E[^(Z f _i)] - E[^(Zi)]| = | E ty(R) + Xi • V (1) (R) + \xi • V> (2) (R) + ±X? • ^(R) + error: 

- E ty(R) + ft • (R) + |Sf • V> (2) (R) + H^- • ^ (3) (R) + error 2 
= | Eferrori — error2]|. 

Here we have used that fact that R is independent of X\ and Gi, along with the assumption 
that Xi and Gi have matching first and second moments. Substituting bounds on the error 
terms errori and error2 given by Taylor's theorem, we complete the proof: 



E[errori — erro^]) < E 



WU-xf , ||V (4) 



24 24 



Odl^lU-E^ 4 ]). 



The same proof can be rewritten to show that if Y = Y\ + . . . +Y n is the sum of independent 
random variables satisfying EpQ] = E|Y<], E[Xf] = E[Xf], and B[Xf] = E[Xf] (the 
matching moments property), then | E[^(X) - i(Y)]\ < ||^ (4) ||oo • E"=i ^\ x t\ + E K 4 ]- 

Let iftt '■ R — > R be the threshold function that takes value 1 if x < t, and otherwise. 
Of course the 4 th derivative of this function is not uniformly bounded, but note that if it 
were the Berry-Esseen theorem would follow as an immediate corollary of Proposition 55. 
Instead, we will use the fact that ipt is well-approximated by a function which does have 
a uniformly bounded 4 th derivative, which then implies a slightly weaker version of the 
Berry-Esseen theorem. 
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Lemma 56 (smooth approximators of thresholds) Let t G M and < A < 1. There 

exists a function Tp t ^x '■ K — s- R H^^Hoo = 0(1/A 4 ) i/iai approximates ifit i n the following 
sense: 

1- A,\( x ) = ^t{x) = 1 if x <t - A. 
2. ^,a(x) e [0,1] i/x G [t — A,t + A]. 
5- = ik(x) = 0ifx>t + X. 

We are now ready to prove a weak version of the Berry-Esseen theorem. 

Proposition 57 (weak Berry-Esseen) LetX\, . . . ,X n be independent, B -reasonable ran- 
dom variables satisfying E[JQ] = 0. Let af := E[X?], r := max{o~?}, and suppose 
Y^ =1 af = 1. Let S = X 1 + . . . + X n and G ~ iV(0, 1). For aZZ t G M, 

| Pr[5 < t] - Pr[g < t}\ < 0{(Bt) 1 / 5 ). 

Proof. Since Tp t +\ : \(x) = 1 for all x < t we have Pr[5 < t] < E[ip t+ \ : \(S)]. Now using the 
fact that ||^t+A aII°° = ^(1M 4 )> we a PPly Proposition 55 to get 

E[^+a,a(^)] = E[ip t +x,xm ± 0(^t/A 4 ) 

Since ipt + x \ { x ) is a t most 1 for all x < i + 2A and otherwise, we have 

E[V> <+ a,a(<?)] < Pr[<? < t + 2A] = Pr[S < t] + O(A), 

and so combining both error bounds gives us E,[ip t+ x,x(S)] < Pr[<5 < t] + 0(Bt/ A 4 ) + 0(A). 
Arguing symmetrically for ipt-\ t \(x) gives us | PrfS 1 < t] — Pr[Q < t]\ = 0(Bt/X 4 ) + O(A), 
and taking A = (Br) 1 / 5 yields the claim. ■ 
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5 Majority Is Stablest 



Friday, 2nd March 2012 



Our definition of p-correlated Gaussians (Definition 39) extend naturally to higher dimen- 
sions: let Q and Q' be independent standard n-dimensional Gaussians (i.e. Q = {Q\, . . . , Q n ) 
where each Qi ~ iV(0, 1) is an independent standard Gaussian, and similarly for Q'). Then 
Q and H := p • Q + ^1 - p 2 ■ Q< are p-correlated Gaussians. Just like in the one-dimension 
case, we have Ei[QiHi] = P for all i G [n]. 

Definition 58 (Gaussian noise stability) Let f : R n — > R and p G [—1,1]. The Gaus- 
sian noise stability of f at noise rate p is 

GStabp(/) := E[/ (£/)/(%)], where G,H are p-correlated Gaussians. 

We begin by showing that GStab p (/) = Stab p (/) for multilinear polynomials / : R n — > R. 
To see this, let f(x) = J2sc\n] c sY\ies x i ano - no ^ e that 

E 



SC\n] ieS ' x TC\n] i£T 



£ csUGiji £ ctUhA = £ c s c T B Jl Gill -Hi = £>|E Jl^ 



S,TC[n] ie5 ieT 5C[n 



L ie5 



where we have used the independence of Qi from Qj,T~Lj for j ^ i, along with the fact that 
E[%j] = E[Qi] = 0. Now again by independence and the fact that EfC/jHj] = p we conclude 
that GStab p (/) = EseM Z 9 ' 5 ' !' which 

agrees with the formula for Stab p (/) we derived in 

Proposition 31. 



5.1 Borell's isoperimetric inequality 

Theorem 59 ([Bor85]) Let f : R n {-1,1} mtt E[/(G)] = 0, where Q is a standard 
n-dimensional Gaussian. Let < p < 1. T/ien GStab p (/ ) < 1 — - arccos(p). 

In this section we present Kindler and O'Donnell's recent simple proof of (a special case 
of) Borell's theorem [K012]. We first introduce a few definitions and give a geometric 
interpretation of the theorem as an isoperimetric inequality in multidimensional Gaussian 
space. 

Definition 60 (rotation sensitivity) Let f : R n — > {—1,1} and 5 G [0,7r]. The rotation 
sensitivity of f at 5 is defined to be RS/(<5) := Pr[f(Q) ^ f(7i)], where Q and H are 
cos(<5) -correlated Gaussians. 

Recall that Q and T~L are cos((5)-correlated if T~L = cos(<5) • Q + sin(<5) • Q' where Q 1 is a 
standard n-dimensional Gaussian independent of Q; we typically we think of 5 as small, so 
cos(<5) ~ 1 — 7}5 2 is close to 1 and sin(5) ~ 5 is a small quantity. If we view / : 1" — > {—1, 1} 
as the indicator of a subset 1/ of R n , the quantity RS/(5) measures the probability that 
the set 1 / separates a random Gaussian vector Q from a noisy copy of it (roughly speaking, 



30 



Q with sin(<5) • Q' of noise added). Therefore we may think of the rotation sensitivity 
of / as a measure of the boundary size of It. Indeed, Kindler and O'Donnell show that 
limsup^Q+ RS/ (5)/ 5 is within a constant factor of the traditional definition of the Guassian 
surface area of If for "sufficiently nice" sets If. 

Since RS/(<5) = Pr[/(£7) / f(Ti)} = \ — \~ GStab cos (,5)(/), Borell's theorem can be equiv- 
alently stated as RS/(5) > - for functions / with E[/(C/)] = 0; it gives a lower bound on 
the boundary size of sets If with Gaussian volume \. Sheppard's formula tells us that if 
/ = sgn(aixi + . . . + a n x n ) then Pr[f(Q) / f(H)] = ^ arccos(cos(<5)) = |, and so Borell's 
inequality is tight when 1/ is any halfspace defined by a hyperplane passing through the 
origin. 

Kindler and O'Donnell prove Borell's theorem for 5 = ^ where I E N (we may assume 
I > 2 since the inequality is trivially true for uncorrelated Gaussians). As a warm-up, we 
consider the case of 5 = \ (i.e. £ = 2). Let / : M. n {-1, 1} with B[f(G)} = and note 
that our goal is to prove RSy(^) > Let Q and Q' be independent standard n-dimensional 
Gaussians and set % = '^ + "^75 'Q' - Note that (0,1-1) and (Q 1 ,%) are both (-^-correlated 
Gaussians, and so we have 

Pr[/(0) + f(H)} + Pv[f(H) + /(<?')] = 2 • RS/(f ). 

By a union bound, the quantity on the left hand side is at most Pr[/(C?) / f(G')], and 
since / is balanced this probability is exactly ^ and the proof is complete. We remark 
that an identical proof can be carried out for bounded functions / : M. n — > [—1,1], with 
RSy(<5) := E[i — \f(G)f(H)} as the generalized definition of rotation sensitivity; all the 
steps remain the same, except that the inequality — |ac) < — ^ab) + (| — }-bc) for all 
a,b,c 6 [—1,1] will be used in place of the union bound. 

Theorem 61 ([K012]) Let / : R n -> {-1,1} with E[/((y)] = 0, where Q is a standard 
n-dimensional Gaussian. Let 5 = |j for some t G N, I > 2. Then RSf(5) > A. 

Proof. Let Q and Q' be independent standard n-dimensional Gaussians. For j = 0, . . . , £, 
we define the hybrid random variable Qv) ■= cos(j5) ■ Q + sm(j5) ■ Q' , and note that they 
interpolate between = Q and = Q' . Next, we claim that Q^" 1 ' and are cos(5)- 
correlated for every j £ [£]. Since the n coordinates are independent, it suffices to consider 
the first coordinates of ^ and Q^' and show that they are cos(5)-correlated. We write 
g(i-V for £p _1) = cos((j-l)5)-g+sw((j-l)5)-g' and for g{ j) = cos(j5)-g+sm(j5)-g' , 
and expand the expectation of their product to check that 

E[gU-VgW] = cos ((j -i)5)cos(j5)E[gg] 

+ cos((j -l)5)sm(j5)B[gg ; ] 

+ sm((j-l)5) cos(jd)E[g'g] 

+ sm((j-l)5) sinO^E^'] 
= cos((j — 1)5) cos(j5) + sin((j — 1)5) sm(jd) 
= cos((j — 1)5 — j5) = cos(<5). 
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Here we have used the trigonometric identity cos(9)cos(6') + sin(0) sin(#') = cos(# — 6'). 
Since ^U -1 ) and are cos(<5)-correlated, we apply the union bound to get \ = Pr[/(£) ^ 
/(£')] < ELi Pr[/(^ _1) ) / /(£ W )] = * • RS/(£), and the proof is complete. ■ 

5.2 Proof outline of MIST 

In this section we sketch the proof of the Majority Is Stablest theorem (MIST): 

Theorem 62 ([KKMO07, MOO10]) Let f : {-1, l} n -»• {-1, 1} and e,p > 0. Suppose 
E[f] = and InU(f) < e for all i € [n]. T/iera Stab p (/) < 1 - | arccos(p) + o e (l). 

Step 1. First consider T]_ 7 / for some small 7 > 0. Note that 

(a) Infi(Ti_ 7 /) < Infi(/) < e for all i e [n]. 

(b) Ti_ 7 / is bounded since Ti_ 7 is an averaging operator. 

(c) Stabp(/) = Stab p /(Ti_ 7 /) where p' := (1 _ P 7 p ■ 

Step 2. Next we truncate Ti_ 7 / to get 5 := (Ti_ 7 /)^ fc = Elsi^C 1 ~ 7) |5| /( 5 ')xs ) where 
A; := poly(-). Note that g has low-degree but it may no longer bounded. Nevertheless we 
can say that 

ll(Ti- 7 /) - g\\l = (1 - 7f |5| /(S) 2 < (1 - i) 2k y, f(S) 2 < 7, 

|5|>fc |S|>Jfc 

and since g is bounded this implies E[sqdist[_ 1)1 ](^(Xi, . . . , X n ))] < 7. For the same reason, 
Stab p '(Ti_ 7 /) < Stabp'(g) + 7. Here sqdist[_ 11 ] : R — > R-° is the function that gives the 
squared distance to the interval [—1,1]; i.e. sqdist[_ 11 ](i) = if t € [—1,1], and (|i| — l) 2 
otherwise. 

Step 3. We apply the invariance principle (an extension of the Berry-Esseen theorem to 
low-degree multilinear polynomials, proved in the next section) to g and the test function 
sqdistr^ij to bound 

E[sqdist hljl] ( g (g))] < E[sqdist [ _ 1)1] ( 5 (X))] + poly(2 fc ,e) = 7 + poly(2 fe , e). 

We are omitting a few details here since the invariance principle requires test functions to 
have uniformly bounded 4 th derivatives, just like in Berry-Esseen, so we actually need a 
smooth approximation of sqdist^ 1 i. 

Step 4. Finally we consider g' : R n — > [—1,1], the truncation of g to the interval [—1,1]; i.e. 
g'(u) = g(u) if g(u) G [—1,1], and sgn(g(u)) otherwise. We have Stab p '(<?) = GStab p '(g) 
since g is multilinear, and GStab p '(</) < 1 — — arccos(p') by Borell's theorem (again we are 
eliding some details here since g' may not satisfy E [</(£/)] = 0). It remains to argue that 
GStabp'(g) ~ GStab p '((/), and for this we need to define the Ornstein-Uhlenbeck operator 
Up, the Gaussian analogue of the noise operator T p : 
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Definition 63 (Ornstein-Uhlenbeck) Let f : R n — > R and p G [—1,1]. The Ornstein- 
Uhlenbeck operator U p acts of f as follows: (U p f)(x) := E[/(p • x + \J\ — p 2 ■ Q)], where Q 
is a standard n- dimensional Gaussian. 

With this definition, we may express Gaussian noise stability as GStab p (/) = E[f(Q)U p (Q)\ 
and compute 

IGStab^-GStaVG/)! = \E[g ■ U p g - g' ■ U p g'}\ 

< | B[g ■ U p g - g' ■ U p g}\ + \ E[g' ■ U p g - g' ■ U p g'}\ 

= \E[(g-g')-U p g]\ + \E[(g-g')-U p g']\ (12) 

= (E[( 5 - 5 ') 2 ]) 1/2 (E[(C/ p5 ) 2 ]) 1/2 

+ (E[( 5 - 5 ') 2 ]) 1/2 (E[(^') 2 ]) 1/2 (13) 

< 2(B[(g-g') 2 }) 1/2 . (14) 

Here (12) holds since E[</ • U p g] = E[U p g' ■ g], (13) is an application of Cauchy-Schwarz, 
and (14) uses the fact that U p is a contraction on 1? . Finally we note that E[(g — g') 2 ] is 
simply E[sqdistr 1 i](g)] and the proof is complete. 



5.3 The invar iance principle 

In this section we prove (a special case of) the Mossel-O'Donnell-Olezkiewicz invariance 
principle [MOO10] for multilinear polynomials with low influences and bounded degree; 
in full generality the principle states that the distribution of such polynomials is essen- 
tially invariant for all product spaces. The crux of the proof is a low-degree analogue of 
Proposition 55; once again we proceed by a hybrid argument, showing that a small error is 
introduced whenever we replace a Rademacher random variable with a standard Gaussian. 
This is sometimes known as the Lindeberg replacement trick, first appearing in Lindeberg's 
proof of the central limit theorem [Lin22]. There has been other work generalizing Linde- 
berg's argument to the non-linear case [Rot 75, Rot 79, Cha06], but these results either yield 
weaker error bounds or require stronger conditions (e.g. worst-case influences rather than 
average-case) . 

Proposition 64 (hybrid argument) Let Q be a degree-d multilinear polynomial Q(u) = 
H\s\<d c sWi<zs u i an d assume: 

1. The coefficients c$ G R are normalized to satisfy ^5-^0 c| = 1. 

2. We write n to denote lnii(Q) = YlsBi^S> an d ^ T = max ie[n] Infj(Q). 

3. ij) : R — > R is a function satisfying \ip^(x)\ < C for all x G R. 

4- X = Q(Xi, . . . , X n ) and Y = Q(Q\, . . . , Q n ) where Xx, . . . , X n are independent Rademach- 
er s and Qi, . . . ,Q n are independent standard Gaussians. 

Then I E[ip(X)) - E[^(Y)]| < d • 9 d • C ■ r. 
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Proof. We first define a sequence of hybrid random variables that interpolate between X and 
Y. For each i = 0, . . . , n we define the random variable Zj = Q(Qi, . . . , Qi,Xi+i, . . . , X n ), 
and note that Zo = X and Z n = Y. As before, it suffices to prove 

|E[^(Z f _i)]-E[^(Zi)]| <C-9 d -r? (15) 

for all i £ [n]. Note that the overall claim follows from the above by telescoping, the triangle 
inequality, and the fact that 

n n n 

E^-E-* = -EE4 = - E \s\-4<T- d . 

i=l i=l i=l S3i |S|<d 

Here in the final inequality we have used our assumption that the coefficients are normalized 
to satisfy Yls^Q c l = Var(X) = Var(Y) = 1. It remains to prove (15). Fix i G [n] and 
first express Q(ui, . . . ,u n ) as the sum of two polynomials R and S, the former comprising 
all terms not containing Uj, and the latter the rest with itj factored out. That is, 

Q(ui, ...,«„) = i?(-ui, • • • ,nj_i,-Uj + i, . . . ,u n ) + u» • S(ui, . . . ,Ui-i,Ui + i, . . . ,u n ), 

where R has degree at most d, and S at most d — 1 (note that if = 1 then S* is simply the 
coefficient a, of itj in the linear polynomial L). Next define the random variables 

R = . . . , Gi-i,Xi + i, . . . ,X n ) 

S = S(Gi,...,Qi-i,Xi+i,...,X n ), 

and note that Zj_i = R + X; • S and Z* = R + Qi ■ S. We bound | E[^(R + X t ■ S)] - 
E[^(R + Qi ■ S)]| by considering their Taylor expansions: 



lE^Zi-i^-Eh^Z 



E ty(R) + (X< • S)^ (1) (R) + • S)V (2) (R) + • S) 3 V (3) (R) + error! 
- E [V(R) + (ft • S)^ (1) (R) + \{Qi ■ S)V (2) (R) + \{Qi ■ S) 3 V (3) (R) + error 2 

Note that the first four terms cancel out since X{ and Qi are independent of S and R, and 
the random variables X{ and Qi have matching first, second and third moments. Applying 
the bounds on the error terms given by Taylor's theorem, we see that 

| E [error! -error 2 ] | < i • CE[Xf ■ S 4 ] + i • CE[Q? ■ S 4 ] (16) 
= ^.CE[S 4 ] + 2 |.CE[S 4 ] (17) 
< C-E[S 4 ]. 

Here (16) uses our assumption that tp^ is uniformly bounded by C, and (17) is by inde- 
pendence along with the fact that E[Qf] = 3. Next, since S is a degree-ci polynomial, we 
may apply Bonami's lemma (Theorem 43) to get C • E[S 4 ] < C ■ 9 d • E[S 2 ] 2 and it remains 
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to argue that E[S 2 ] upper bounded by = Inf j(Q) = Yls3i c s- Indeed, recall that S is the 
polynomial comprising all terms of Q containing U{ with U{ factored out, and so 



where each Yj is either a Rademacher or standard Gaussian random variable, depending on 
whether j < i. We have shown that | E[^(Zj_i)] — E[^>(Zj)]| < C ■ 9 d ■ rf, and the proof is 



In the proof of the Berry-Esseen theorem we needed the fact that the anti-concentration of a 
standard Gaussian Q at radius e is 0(e). That is, for all t S M we have Pr[|£/— 1\ < e] = 0(e). 
The low-degree analogue of this small ball probability is given by the following proposition 
due to Carbery and Wright [CW01, Kanll]. 

Lemma 65 (Carbery- Wright) There exists a universal constant C such that the follow- 
ing holds. Let Q be a multilinear polynomial of degree d over Qi, . . . ,Q n , a sequence of 
independent standard Gaussians, and e > 0. Then 



In particular, if the coefficients of Q are normalized to satisfy Var (Q) = 1 then for alltEM 
ande>0 we have Pr[|Q(0i, . . . , Q n ) - t\ < e] = O^e 1 ^). 

Lemma 66 (smooth test functions) Let r > 2 be an integer. There exists a constant 
B r such that for all < A < 1/2 and (£R there exists a function A\t : M — >• M satisfying 
the following: 

i is smooth and \\ (A^f) ^ jjoo 

2. Ax, t (x) = 1 for all x <t- 2X. 

3. A x ,t (x) e [0, 1] for all x £ (t - 2 A, t + 2 A) . 

4. A x ,t(x) = for all x>t + 2A. 

We are now ready to prove the invariance principle. 

Theorem 67 (invariance) Let Q(ui, . . . , u n ) = X^5c[n] c s Ylies Ui ^ e a degree-d multilin- 
ear polynomial and assume 

1. The coefficients cs 6 M are normalized to satisfy X^s^0 c 5 = 1- 

2. We write t% to denote Infj(Q) = YIsbi c "s> an d ^ T = max ie[n] Infi(Q). 

3. X = Q(Xi, . . . , X n ) andY = Q(Gi, ■ ■ ■ ,Gn) where X\, . . . , X n are independent Rademach- 
ers and Q\, ... ,Q n are independent standard Gaussians. 



Then for all teR, \ Pr[Q(Xi, . . . ,X n ) < t]-Vr[Q(g x , . . . , Q n ) <t]\ = O(d-(10 d -r) 1 /( 4d + 1 )) . 




complete. 



Pt[\Q(G u ... 



Qn)\<e]<C-d.{e/\\Q(G)\\ 2 ) 



l/d 
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Proof. Let t£l. We will write Q(X) to denote Q(X U . . .,X n ), Q{Q) for Q(G 1 , . . .,g n ), 
and ij) for A^ t+2 A ( r = 4), for some value of A > to be determined later. Recall that 
if)(x) = 1 for all x > (t + 2A) - 2A = t and so Pt[Q(X) < t] < E[tp(Q(X))]. Since V {4) is 
uniformly bounded by 0(1/ A 4 ), we apply Proposition 64 to get that 

E[ii>(Q(X))] < E[^(Q(g))]+O(10 d -T-A- 4 ). 

< Pr[Q(0) <t + 4A] + O(10 d -r-A- 4 ). (18) 

= Pr[Q(0) <t] + Pr[Q(0) e(t,t + 4A)] + O(10 d • r • A" 4 ) 

= Pr[Q(G) <t] + O(d-(4A) 1 / d ) + O(10 d -r-A- 4 ). (19) 

Here (18) is again by the properties of tp, this time using the fact that ip(x) = for all 
x > (t + 2A) + 2A, and (19) is by Carbery- Wright. Choosing A = (10 d • T ) d /^ d+1 \ we have 
shown that 

E[ip(Q(X))} <Pr[Q{g) <t} + O(d-(10 d -r) 1 /^ d +^). 

A symmetric argument establishes the analogous lower bound on E[^(Q(X))], and this 
completes the proof. ■ 
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6 Testing dictators and UGC-hardness 



Saturday, 3rd March 2012 
Guest lecture by Per Austrin 

Definition 68 (unique label cover) Let L be a positive integer. An instance \& of the 
L-Unique-Label-Cover problem is a graph G = (V, E) where each edge e € E has an 
associated constraint that is a permutation n e : [L] — >■ [L] . A labelling of ^ is a assignment 
to the vertices I : V — > [L]. We say that an edge (x,y) is satisfied by I if £(x) = ir e (£(y)), 
and the value of £ is the fraction of edges satisfied by I. The optimum of^, denoted opt(^), 
is the maximum value of the optimal assignment I. 

The Unique-Label-Cover problem is a special case of the Label-Cover problem where 
the constraints 7r e : [L] — > [L] are not required to be permutations. In particular, in the 
Unique-Label-Cover problem assigning a label to a vertex necessarily determines the 
labels of all its neighbors, whereas this is not the case for the Label-Cover problem. 
Consequently, for Unique- Label- Cover the task of deciding whether there is an assign- 
ment that satisfies all the edges (i.e. distinguishing opt(^) = 1 versus opt(^) < 1) is 
easy: assume a label for a vertex v and deduce the labels for the remaining vertices in a 
breadth-first fashion. If there is a conflict at some vertex we choose another label for v and 
repeat the same process. If no consistent labeling can be found after iterating through all 
L possible labels for v then opt(^) < 1; otherwise opt(^) = 1. This is in sharp contrast to 
the situation for Label-Cover: it is known that for every e > there is an L such that it 
is NP-hard to distinguish between opt( x I / ) = 1 versus opt('I') < e where is an instance of 
L-Label-Cover; we sometimes refer to this as the (1, e)-hardness of Label-Cover. 

The Unique Games Conjecture of S. Khot [Kho02] asserts that the Unique-Label-Cover 
problem is nevertheless very hard to approximate as soon as we move to almost-satisfiable 
instances. 

Conjecture 69 (unique games) For every e > there exists an L such that the it is 
NP-hard to distinguish between opt(^/) > 1 — e versus opt( x I') < e, where is an instance 
of L-Unique-Label-Cover. Equivalently, for every e > there exists an L such that the 
L-Unique-Label-Cover problem is (1 — e,e)-hard. 

Recent work of S. Arora, B. Barak and D. Steurer [ABS10] gives an algorithm for Unique- 
Label-Cover running in time exp(n poly ( £ )). 

Today we will prove the following theorem showing how explicit (c, s) dictator versus quasir- 
andom tests yield Unique Games-based hardness results for certain constraint satisfaction 
problems [Kho02, KR03, KKMO07, Aus08]: 

Theorem 70 Suppose we have a (c, s) dictator versus quasirandom using predicates from 
a set T. For every L there exist a polynomial time reduction R from L-Unique-Label- 
Cover to MAX-CSP(T) such that for every instance ^ of L-Unique-Label-Cover and 
every e > 0, there exists a 5 > satisfying 
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1. (Completeness) //opt(^) > 1 - 5 then opt(i?(*)) >c — e. 

2. (Soundness) //opt(^) < 5 then opt(R(^>)) < s + e. 

First, a small catch: the dictator versus quasirandom test have to work not only for 
boolean functions but also for bounded functions / : {— 1,1}" —> [—1,1]. We may view 
any predicate <j> : {-l,l} fc ->■ {0,1} as <f>* : [—1,1]* ->■ [0,1], where <t>*(yi, ■ ■ ■ ,Vk) ■= 
E[0(a?i, . . . ,Xk)], the expectation taken with respect to {—1, l}-valued random variables 
xi satisfying E[sci] = y«. It is easy to check that 4>*(x) = <j>(x) for all x G {—1, l} fc , and 
in fact we have 4>*(yx, ■ ■ ■ , yk) = Z^scffc] Ylies Vi- With this observation any tester 
for boolean functions using predicate <j) can be extended to one for all bounded func- 
tions: instead of accepting iff 4>(f(xi), . . . , /(xfc)) = 1, the tester accepts with probability 
0*(/(xi),...,/(x fc ))€[O,l]. 

The reduction from Unique-Label-Cover to MAX-CSP 

With this caveat out of the way, we are now ready to describe the reduction R: 

Let $ be an instance of L-Unique-Label-Cover defined over a graph 
G = (V, E). Suppose we have a (c, s) dictator versus quasirandom test 
for functions {—1, 1} L — > [—1, 1] using fc-ary predicates from a set T. 
Consider the following instance R(^f) of MAX-CSP(T): 

Variables. There will be \V\ ■ 2 variables: for each «eFwe define 
2 L boolean variables {Z U)X : x G {— 1,1}^}. 

Constraints. A random constraint will be sampled as follows: 

1. Pick u G V uniformly. 

2. Pick k neighbors v±, . . . , v k G N(u) of u uniformly independently. 

3. Define f U)V . := f Vi (x o ir UjVi ). 

4. Pick Xi, . . . ,Xk G { — 1, 1} L according to the distribution over k- 
tuples induced by the tester, and set y^ := f u>Vi (xj). 

5. Return the constraint (f>(yi, ■ ■ ■ ,yk) = 1. 



In step 3, ~k u ,v % is the permutation associated with the edge (u,Vi) G E, and x o ir UtVi is 
the string x with its coordinates permuted according to ir UtVi . For each u G V, it will be 
convenient for us to think of an assignment to the corresponding 2 L variables of the CSP as a 
boolean function f u : { — 1, 1} L — > { — 1, 1}, where Z UjX <— f u (x); an assignment to all |V| - 2 L 
variables can then be defined as a set of | V| boolean functions {f u : {—1, 1} L — > { — 1, l}} u ev- 
We will assume that G is regular; this is without loss of generality by a result of Khot and 
Regev [KR03]. 
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Completeness 

Suppose opt(*I>) > 1 — 5, and let £ : V — >■ [L] be a labelling achieving this. Our goal is to 
exhibit an assignment to the variables of the CSP R(^f) that satisfies at least a c — e fraction 
of constraints. We consider the fraction satisfied by the assignment f u (x) := DICT^ u )(x) = 
x^y) for all u £ V. 

Consider an edge (u,Vi) G E satisfied by £ (i.e. l(u) = 7T UjVi (£(vi)), and note that 

fu,Vi(x) = f Vi (x o 7T U}V .) = (X0 1fu,Vi) i{vi) = x n u , n (e(v t )) = X£(u) = DICT^( M ). 

Therefore if the k edges incident to u (chosen in step 2 above) are all satisfied by i then 
• • • , Vh)\ is the probability that the test accepts DICT^ U ), at least c by our assump- 
tion. Since G is regular and i satisfies a 1 — 5 fraction of all edges, the probability that k 
uniformly random edges incident to a random u G V is satisfied by £ is at least 1 — k5, and 
so we have opt(i?( v I / )) > c • (1 — kS) > c — e (for sufficiently small 8). 

Soundness 

We will assume that opt(i?(^)) > s + e and prove opt(^) = fl £ (l). We first express the 
fraction of satisfied constraints as 

s + e< E [<t)(f UjVl (xi),...J UiVk (x j ))]= E [^(Elfu^X!)], . . . ,~E[f UiVk (x k )])] 

X\ . -iXfc 

For each u G V, let g u {—l, 1} L — > [—1, 1] be the function defined by g u (x) := E veN ^ [f u ,v{x)], 
and so the above can be rewritten as E MjXlj ... jXn [(p(g u (xi), . . . , g u {xk))] > s + e. By an aver- 
aging argument, at least an | fraction of all u G V satisfy Ft Xl) ^^ Xk [(p(g u (xi), . . . ,g u (xk))] > 
s + 1 ; call these it G V "good" . By the soundness condition of a dictator versus quasirandom 
test, it follows that if u is good g u cannot be (7, 7)-quasirandom for some 7 = f2 £ (l); in 
particular, there must exist at least one i 6 [L] such that Inf^ 1 ^ (g u ) > 7. 

Let J u = {i G [L] : Inf. 1 (g u ) > 7} ; and note that | J u \ < 4y by Proposition 51. We claim 
that every i G J u satisfies Pr v&N ^ [inf^^ (/,,) > 2] > 2 ( once again, at least one such i 
exists if u is good). To see this, it suffices to check that 

Jiu) = ? M^fc)] > Inff- 7) (E [JQ) = Inff^) > 7; 

the claim then follows by Markov's inequality. For every u £ V we also define J' u = {j G 
[L] : hnf ~ 7) (/ u ) > 2}, noting that |4| < 

Consider the following labelling £ : V — > [L]: for each u G V, if J u U is non-empty we 
assign u a uniformly random label in J U U J' u , otherwise we assign u an arbitrary label. It 
remains to prove that I satisfies an S7 e (l) fraction of edges. We have shown that for at 
least an | • ^ = £Z fraction of edges (u, v ) there exists an % G [L] such that i G J U U J' u 
and 7T~J,(i) G J v U J',. Conditioned on the existence of such an i, the edge is satisfied if 
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£{u) = i and £{v) = ir u l(i) (recall that (u,v) is satisfied if £{u) = 7r UtV (£(v))), and this 
happens with probability at least (I J u U J' U \\J V U JLl)' 1 > (-tt + "%) 2 = "a - - We conclude 
that opt(^) = fi(e7 • 7 4 ) = fi( e 7 5 ) = fi e (l), and the proof is complete. 

For further details see [Aus08]. 
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